Speaker interpolation for HMM-based speech synthesis system.
نویسندگان
چکیده
منابع مشابه
Speaker interpolation in HMM-based speech synthesis system
This paper describes an approach to voice characteristics conversion for HMM-based text-to-speech synthesis system by using speaker interpolation. An HMM interpolation technique is derived from a probabilistic distance measure for HMMs, and used to synthesize speech with untrained speaker’s characteristics by interpolating HMM parameters among some representative speakers’ HMM sets. The result ...
متن کاملSpeaker-Independent HMM-based Speech Synthesis System
This paper describes an HMM-based speech synthesis system developed by the HTS working group for the Blizzard Challenge 2007. To further explore the potential of HMM-based speech synthesis, we incorporate new features in our conventional system which underpin a speaker-independent approach: speaker adaptation techniques; adaptive training for HSMMs; and full covariance modeling using the CSMAPL...
متن کاملCross-lingual Speaker Adaptation for HMM-based Speech Synthesis based on Perceptual Characteristics and Speaker Interpolation
This paper proposes a cross-lingual speaker adaptation (CLSA) method based on perceptual characteristics (PCs). To develop a CLSA system, a state mapping (SM) based method has been recently proposed. This method extracts speaker characteristics directly from source acoustic features as linear transforms and applies them to target models. However, it is difficult to completely eliminate language...
متن کاملSpeaker adaptation for HMM-based speech synthesis system using MLLR
This paper describes a voice characteristics conversion technique for an HMM-based text-to-speech synthesis system. The system uses phoneme HMMs as the speech synthesis units, and voice characteristics conversion is achieved by changing HMM parameters appropriately. To transform the voice characteristics of synthetic speech to the target speaker, we apply an MLLR (Maximum Likelihood Linear Regr...
متن کاملAnalysis of speaker clustering strategies for HMM-based speech synthesis
This paper describes a method for speaker clustering, with the application of building average voice models for speakeradaptive HMM-based speech synthesis that are a good basis for adapting to specific target speakers. Our main hypothesis is that using perceptually similar speakers to build the average voice model will be better than use unselected speakers, even if the amount of data available...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the Acoustical Society of Japan (E)
سال: 2000
ISSN: 0388-2861,2185-3509
DOI: 10.1250/ast.21.199